Overview

Brought to you by YData

Dataset statistics

Number of variables13
Number of observations1000
Missing cells95
Missing cells (%)0.7%
Duplicate rows33
Duplicate rows (%)3.3%
Total size in memory101.7 KiB
Average record size in memory104.1 B

Variable types

Text5
Numeric4
Categorical4

Alerts

Dataset has 33 (3.3%) duplicate rowsDuplicates
km_driven is highly overall correlated with yearHigh correlation
selling_price is highly overall correlated with transmission and 1 other fieldsHigh correlation
transmission is highly overall correlated with selling_priceHigh correlation
year is highly overall correlated with km_driven and 1 other fieldsHigh correlation
seller_type is highly imbalanced (52.7%) Imbalance
mileage has 19 (1.9%) missing values Missing
engine has 19 (1.9%) missing values Missing
max_power has 19 (1.9%) missing values Missing
torque has 19 (1.9%) missing values Missing
seats has 19 (1.9%) missing values Missing

Reproduction

Analysis started2024-11-19 22:42:56.588106
Analysis finished2024-11-19 22:43:09.119004
Duration12.53 seconds
Software versionydata-profiling vv4.12.0
Download configurationconfig.json

Variables

name
Text

Distinct621
Distinct (%)62.1%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2024-11-20T01:43:09.823534image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length49
Median length39
Mean length24.857
Min length11

Characters and Unicode

Total characters24857
Distinct characters67
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique440 ?
Unique (%)44.0%

Sample

1st rowMahindra Xylo E4 BS IV
2nd rowTata Nexon 1.5 Revotorq XE
3rd rowHonda Civic 1.8 S AT
4th rowHonda City i DTEC VX
5th rowTata Indica Vista Aura 1.2 Safire BSIV
ValueCountFrequency (%)
maruti 290
 
6.2%
hyundai 198
 
4.2%
tata 106
 
2.3%
mahindra 90
 
1.9%
diesel 83
 
1.8%
swift 83
 
1.8%
bsiv 79
 
1.7%
vxi 74
 
1.6%
1.2 71
 
1.5%
plus 64
 
1.4%
Other values (495) 3549
75.7%
2024-11-20T01:43:11.069067image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3687
 
14.8%
a 1852
 
7.5%
i 1631
 
6.6%
t 1253
 
5.0%
r 1094
 
4.4%
o 1010
 
4.1%
n 934
 
3.8%
e 890
 
3.6%
u 738
 
3.0%
S 701
 
2.8%
Other values (57) 11067
44.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 24857
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
3687
 
14.8%
a 1852
 
7.5%
i 1631
 
6.6%
t 1253
 
5.0%
r 1094
 
4.4%
o 1010
 
4.1%
n 934
 
3.8%
e 890
 
3.6%
u 738
 
3.0%
S 701
 
2.8%
Other values (57) 11067
44.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 24857
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
3687
 
14.8%
a 1852
 
7.5%
i 1631
 
6.6%
t 1253
 
5.0%
r 1094
 
4.4%
o 1010
 
4.1%
n 934
 
3.8%
e 890
 
3.6%
u 738
 
3.0%
S 701
 
2.8%
Other values (57) 11067
44.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 24857
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
3687
 
14.8%
a 1852
 
7.5%
i 1631
 
6.6%
t 1253
 
5.0%
r 1094
 
4.4%
o 1010
 
4.1%
n 934
 
3.8%
e 890
 
3.6%
u 738
 
3.0%
S 701
 
2.8%
Other values (57) 11067
44.5%

year
Real number (ℝ)

High correlation 

Distinct24
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2013.681
Minimum1995
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2024-11-20T01:43:11.476472image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum1995
5-th percentile2006
Q12011
median2014
Q32017
95-th percentile2019
Maximum2020
Range25
Interquartile range (IQR)6

Descriptive statistics

Standard deviation4.0121486
Coefficient of variation (CV)0.001992445
Kurtosis1.2158841
Mean2013.681
Median Absolute Deviation (MAD)3
Skewness-1.0223557
Sum2013681
Variance16.097336
MonotonicityNot monotonic
2024-11-20T01:43:11.988995image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
2017 134
13.4%
2016 106
10.6%
2015 96
9.6%
2018 91
9.1%
2011 85
8.5%
2012 83
8.3%
2014 79
7.9%
2013 76
7.6%
2019 64
6.4%
2010 49
 
4.9%
Other values (14) 137
13.7%
ValueCountFrequency (%)
1995 1
 
0.1%
1998 1
 
0.1%
1999 5
 
0.5%
2000 1
 
0.1%
2001 2
 
0.2%
2002 4
 
0.4%
2003 8
 
0.8%
2004 10
1.0%
2005 10
1.0%
2006 20
2.0%
ValueCountFrequency (%)
2020 4
 
0.4%
2019 64
6.4%
2018 91
9.1%
2017 134
13.4%
2016 106
10.6%
2015 96
9.6%
2014 79
7.9%
2013 76
7.6%
2012 83
8.3%
2011 85
8.5%

selling_price
Real number (ℝ)

High correlation 

Distinct274
Distinct (%)27.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean617901.04
Minimum31000
Maximum6000000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2024-11-20T01:43:12.558708image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum31000
5-th percentile100000
Q1250000
median434999
Q3670000
95-th percentile1904049
Maximum6000000
Range5969000
Interquartile range (IQR)420000

Descriptive statistics

Standard deviation758553.86
Coefficient of variation (CV)1.22763
Kurtosis21.438457
Mean617901.04
Median Absolute Deviation (MAD)205000
Skewness4.2148309
Sum6.1790104 × 108
Variance5.7540396 × 1011
MonotonicityNot monotonic
2024-11-20T01:43:13.232821image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
300000 29
 
2.9%
350000 28
 
2.8%
600000 28
 
2.8%
550000 25
 
2.5%
400000 24
 
2.4%
650000 24
 
2.4%
250000 22
 
2.2%
750000 22
 
2.2%
500000 22
 
2.2%
450000 16
 
1.6%
Other values (264) 760
76.0%
ValueCountFrequency (%)
31000 1
 
0.1%
33983 1
 
0.1%
35000 1
 
0.1%
40000 1
 
0.1%
45000 5
0.5%
46000 1
 
0.1%
50000 2
 
0.2%
52000 2
 
0.2%
55000 3
0.3%
55599 1
 
0.1%
ValueCountFrequency (%)
6000000 2
 
0.2%
5500000 5
0.5%
5400000 2
 
0.2%
5150000 3
 
0.3%
4100000 1
 
0.1%
3800000 2
 
0.2%
3750000 1
 
0.1%
3400000 1
 
0.1%
3251000 1
 
0.1%
3200000 8
0.8%

km_driven
Real number (ℝ)

High correlation 

Distinct260
Distinct (%)26.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean71393.341
Minimum1303
Maximum375000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2024-11-20T01:43:13.695503image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum1303
5-th percentile9190
Q137000
median61500
Q3100000
95-th percentile160000
Maximum375000
Range373697
Interquartile range (IQR)63000

Descriptive statistics

Standard deviation48486.219
Coefficient of variation (CV)0.67914203
Kurtosis3.8337561
Mean71393.341
Median Absolute Deviation (MAD)28500
Skewness1.4228571
Sum71393341
Variance2.3509134 × 109
MonotonicityNot monotonic
2024-11-20T01:43:14.080071image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
120000 66
 
6.6%
70000 58
 
5.8%
60000 55
 
5.5%
80000 54
 
5.4%
40000 46
 
4.6%
50000 44
 
4.4%
90000 38
 
3.8%
110000 35
 
3.5%
100000 33
 
3.3%
30000 27
 
2.7%
Other values (250) 544
54.4%
ValueCountFrequency (%)
1303 1
 
0.1%
2000 7
0.7%
2388 1
 
0.1%
2600 1
 
0.1%
3100 1
 
0.1%
3500 2
 
0.2%
3564 1
 
0.1%
4000 1
 
0.1%
4337 1
 
0.1%
5000 9
0.9%
ValueCountFrequency (%)
375000 1
0.1%
300000 2
0.2%
298000 1
0.1%
291000 1
0.1%
270000 1
0.1%
265000 1
0.1%
264000 1
0.1%
260000 1
0.1%
250000 1
0.1%
248000 1
0.1%

fuel
Categorical

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Diesel
534 
Petrol
457 
CNG
 
5
LPG
 
4

Length

Max length6
Median length6
Mean length5.973
Min length3

Characters and Unicode

Total characters5973
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDiesel
2nd rowDiesel
3rd rowPetrol
4th rowDiesel
5th rowPetrol

Common Values

ValueCountFrequency (%)
Diesel 534
53.4%
Petrol 457
45.7%
CNG 5
 
0.5%
LPG 4
 
0.4%

Length

2024-11-20T01:43:14.555809image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-11-20T01:43:14.856979image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
ValueCountFrequency (%)
diesel 534
53.4%
petrol 457
45.7%
cng 5
 
0.5%
lpg 4
 
0.4%

Most occurring characters

ValueCountFrequency (%)
e 1525
25.5%
l 991
16.6%
D 534
 
8.9%
i 534
 
8.9%
s 534
 
8.9%
P 461
 
7.7%
t 457
 
7.7%
r 457
 
7.7%
o 457
 
7.7%
G 9
 
0.2%
Other values (3) 14
 
0.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 5973
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 1525
25.5%
l 991
16.6%
D 534
 
8.9%
i 534
 
8.9%
s 534
 
8.9%
P 461
 
7.7%
t 457
 
7.7%
r 457
 
7.7%
o 457
 
7.7%
G 9
 
0.2%
Other values (3) 14
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 5973
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 1525
25.5%
l 991
16.6%
D 534
 
8.9%
i 534
 
8.9%
s 534
 
8.9%
P 461
 
7.7%
t 457
 
7.7%
r 457
 
7.7%
o 457
 
7.7%
G 9
 
0.2%
Other values (3) 14
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 5973
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 1525
25.5%
l 991
16.6%
D 534
 
8.9%
i 534
 
8.9%
s 534
 
8.9%
P 461
 
7.7%
t 457
 
7.7%
r 457
 
7.7%
o 457
 
7.7%
G 9
 
0.2%
Other values (3) 14
 
0.2%

seller_type
Categorical

Imbalance 

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Individual
837 
Dealer
135 
Trustmark Dealer
 
28

Length

Max length16
Median length10
Mean length9.628
Min length6

Characters and Unicode

Total characters9628
Distinct characters17
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowIndividual
2nd rowIndividual
3rd rowIndividual
4th rowIndividual
5th rowIndividual

Common Values

ValueCountFrequency (%)
Individual 837
83.7%
Dealer 135
 
13.5%
Trustmark Dealer 28
 
2.8%

Length

2024-11-20T01:43:15.147178image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-11-20T01:43:15.377448image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
ValueCountFrequency (%)
individual 837
81.4%
dealer 163
 
15.9%
trustmark 28
 
2.7%

Most occurring characters

ValueCountFrequency (%)
d 1674
17.4%
i 1674
17.4%
a 1028
10.7%
l 1000
10.4%
u 865
9.0%
I 837
8.7%
n 837
8.7%
v 837
8.7%
e 326
 
3.4%
r 219
 
2.3%
Other values (7) 331
 
3.4%

Most occurring categories

ValueCountFrequency (%)
(unknown) 9628
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
d 1674
17.4%
i 1674
17.4%
a 1028
10.7%
l 1000
10.4%
u 865
9.0%
I 837
8.7%
n 837
8.7%
v 837
8.7%
e 326
 
3.4%
r 219
 
2.3%
Other values (7) 331
 
3.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 9628
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
d 1674
17.4%
i 1674
17.4%
a 1028
10.7%
l 1000
10.4%
u 865
9.0%
I 837
8.7%
n 837
8.7%
v 837
8.7%
e 326
 
3.4%
r 219
 
2.3%
Other values (7) 331
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 9628
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
d 1674
17.4%
i 1674
17.4%
a 1028
10.7%
l 1000
10.4%
u 865
9.0%
I 837
8.7%
n 837
8.7%
v 837
8.7%
e 326
 
3.4%
r 219
 
2.3%
Other values (7) 331
 
3.4%

transmission
Categorical

High correlation 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Manual
877 
Automatic
123 

Length

Max length9
Median length6
Mean length6.369
Min length6

Characters and Unicode

Total characters6369
Distinct characters11
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowManual
2nd rowManual
3rd rowAutomatic
4th rowManual
5th rowManual

Common Values

ValueCountFrequency (%)
Manual 877
87.7%
Automatic 123
 
12.3%

Length

2024-11-20T01:43:15.636378image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-11-20T01:43:15.867872image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
ValueCountFrequency (%)
manual 877
87.7%
automatic 123
 
12.3%

Most occurring characters

ValueCountFrequency (%)
a 1877
29.5%
u 1000
15.7%
M 877
13.8%
n 877
13.8%
l 877
13.8%
t 246
 
3.9%
A 123
 
1.9%
o 123
 
1.9%
m 123
 
1.9%
i 123
 
1.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 6369
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 1877
29.5%
u 1000
15.7%
M 877
13.8%
n 877
13.8%
l 877
13.8%
t 246
 
3.9%
A 123
 
1.9%
o 123
 
1.9%
m 123
 
1.9%
i 123
 
1.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 6369
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 1877
29.5%
u 1000
15.7%
M 877
13.8%
n 877
13.8%
l 877
13.8%
t 246
 
3.9%
A 123
 
1.9%
o 123
 
1.9%
m 123
 
1.9%
i 123
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 6369
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 1877
29.5%
u 1000
15.7%
M 877
13.8%
n 877
13.8%
l 877
13.8%
t 246
 
3.9%
A 123
 
1.9%
o 123
 
1.9%
m 123
 
1.9%
i 123
 
1.9%

owner
Categorical

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
First Owner
623 
Second Owner
278 
Third Owner
71 
Fourth & Above Owner
 
27
Test Drive Car
 
1

Length

Max length20
Median length11
Mean length11.524
Min length11

Characters and Unicode

Total characters11524
Distinct characters24
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st rowFirst Owner
2nd rowFirst Owner
3rd rowFirst Owner
4th rowFirst Owner
5th rowSecond Owner

Common Values

ValueCountFrequency (%)
First Owner 623
62.3%
Second Owner 278
27.8%
Third Owner 71
 
7.1%
Fourth & Above Owner 27
 
2.7%
Test Drive Car 1
 
0.1%

Length

2024-11-20T01:43:16.145788image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-11-20T01:43:16.375738image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
ValueCountFrequency (%)
owner 999
48.6%
first 623
30.3%
second 278
 
13.5%
third 71
 
3.5%
fourth 27
 
1.3%
27
 
1.3%
above 27
 
1.3%
test 1
 
< 0.1%
drive 1
 
< 0.1%
car 1
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
r 1722
14.9%
e 1306
11.3%
n 1277
11.1%
1055
9.2%
O 999
8.7%
w 999
8.7%
i 695
6.0%
t 651
 
5.6%
F 650
 
5.6%
s 624
 
5.4%
Other values (14) 1546
13.4%

Most occurring categories

ValueCountFrequency (%)
(unknown) 11524
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
r 1722
14.9%
e 1306
11.3%
n 1277
11.1%
1055
9.2%
O 999
8.7%
w 999
8.7%
i 695
6.0%
t 651
 
5.6%
F 650
 
5.6%
s 624
 
5.4%
Other values (14) 1546
13.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 11524
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
r 1722
14.9%
e 1306
11.3%
n 1277
11.1%
1055
9.2%
O 999
8.7%
w 999
8.7%
i 695
6.0%
t 651
 
5.6%
F 650
 
5.6%
s 624
 
5.4%
Other values (14) 1546
13.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 11524
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
r 1722
14.9%
e 1306
11.3%
n 1277
11.1%
1055
9.2%
O 999
8.7%
w 999
8.7%
i 695
6.0%
t 651
 
5.6%
F 650
 
5.6%
s 624
 
5.4%
Other values (14) 1546
13.4%

mileage
Text

Missing 

Distinct237
Distinct (%)24.2%
Missing19
Missing (%)1.9%
Memory size7.9 KiB
2024-11-20T01:43:16.880423image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length9.4057085
Min length8

Characters and Unicode

Total characters9227
Distinct characters18
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)7.4%

Sample

1st row14.0 kmpl
2nd row21.5 kmpl
3rd row12.9 kmpl
4th row25.1 kmpl
5th row16.5 kmpl
ValueCountFrequency (%)
kmpl 972
49.5%
18.6 23
 
1.2%
18.9 22
 
1.1%
21.1 22
 
1.1%
19.7 21
 
1.1%
16.1 17
 
0.9%
12.8 16
 
0.8%
17.0 16
 
0.8%
22.74 15
 
0.8%
18.2 15
 
0.8%
Other values (225) 823
41.9%
2024-11-20T01:43:17.695901image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
k 990
10.7%
. 981
10.6%
981
10.6%
m 981
10.6%
l 972
10.5%
p 972
10.5%
1 783
8.5%
2 652
7.1%
0 271
 
2.9%
5 251
 
2.7%
Other values (8) 1393
15.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 9227
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
k 990
10.7%
. 981
10.6%
981
10.6%
m 981
10.6%
l 972
10.5%
p 972
10.5%
1 783
8.5%
2 652
7.1%
0 271
 
2.9%
5 251
 
2.7%
Other values (8) 1393
15.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 9227
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
k 990
10.7%
. 981
10.6%
981
10.6%
m 981
10.6%
l 972
10.5%
p 972
10.5%
1 783
8.5%
2 652
7.1%
0 271
 
2.9%
5 251
 
2.7%
Other values (8) 1393
15.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 9227
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
k 990
10.7%
. 981
10.6%
981
10.6%
m 981
10.6%
l 972
10.5%
p 972
10.5%
1 783
8.5%
2 652
7.1%
0 271
 
2.9%
5 251
 
2.7%
Other values (8) 1393
15.1%

engine
Text

Missing 

Distinct88
Distinct (%)9.0%
Missing19
Missing (%)1.9%
Memory size7.9 KiB
2024-11-20T01:43:18.115679image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length6.8236493
Min length6

Characters and Unicode

Total characters6694
Distinct characters12
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)1.9%

Sample

1st row2498 CC
2nd row1497 CC
3rd row1799 CC
4th row1498 CC
5th row1172 CC
ValueCountFrequency (%)
cc 981
50.0%
1248 116
 
5.9%
1197 105
 
5.4%
796 63
 
3.2%
998 57
 
2.9%
1396 51
 
2.6%
2179 49
 
2.5%
1498 47
 
2.4%
2494 32
 
1.6%
1199 31
 
1.6%
Other values (79) 430
21.9%
2024-11-20T01:43:18.796583image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
C 1962
29.3%
981
14.7%
1 959
14.3%
9 855
12.8%
4 386
 
5.8%
8 366
 
5.5%
2 345
 
5.2%
7 290
 
4.3%
6 223
 
3.3%
3 147
 
2.2%
Other values (2) 180
 
2.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 6694
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
C 1962
29.3%
981
14.7%
1 959
14.3%
9 855
12.8%
4 386
 
5.8%
8 366
 
5.5%
2 345
 
5.2%
7 290
 
4.3%
6 223
 
3.3%
3 147
 
2.2%
Other values (2) 180
 
2.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 6694
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
C 1962
29.3%
981
14.7%
1 959
14.3%
9 855
12.8%
4 386
 
5.8%
8 366
 
5.5%
2 345
 
5.2%
7 290
 
4.3%
6 223
 
3.3%
3 147
 
2.2%
Other values (2) 180
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 6694
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
C 1962
29.3%
981
14.7%
1 959
14.3%
9 855
12.8%
4 386
 
5.8%
8 366
 
5.5%
2 345
 
5.2%
7 290
 
4.3%
6 223
 
3.3%
3 147
 
2.2%
Other values (2) 180
 
2.7%

max_power
Text

Missing 

Distinct182
Distinct (%)18.6%
Missing19
Missing (%)1.9%
Memory size7.9 KiB
2024-11-20T01:43:19.554390image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length7.7787971
Min length6

Characters and Unicode

Total characters7631
Distinct characters15
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)5.6%

Sample

1st row112 bhp
2nd row108.5 bhp
3rd row130 bhp
4th row98.6 bhp
5th row65 bhp
ValueCountFrequency (%)
bhp 981
50.0%
74 43
 
2.2%
88.5 28
 
1.4%
47.3 24
 
1.2%
81.80 24
 
1.2%
67.1 22
 
1.1%
46.3 21
 
1.1%
88.73 20
 
1.0%
88.7 20
 
1.0%
67.04 19
 
1.0%
Other values (173) 760
38.7%
2024-11-20T01:43:20.659515image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
981
12.9%
h 981
12.9%
b 981
12.9%
p 981
12.9%
. 617
8.1%
8 546
7.2%
1 448
5.9%
7 422
 
5.5%
6 308
 
4.0%
3 278
 
3.6%
Other values (5) 1088
14.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 7631
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
981
12.9%
h 981
12.9%
b 981
12.9%
p 981
12.9%
. 617
8.1%
8 546
7.2%
1 448
5.9%
7 422
 
5.5%
6 308
 
4.0%
3 278
 
3.6%
Other values (5) 1088
14.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 7631
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
981
12.9%
h 981
12.9%
b 981
12.9%
p 981
12.9%
. 617
8.1%
8 546
7.2%
1 448
5.9%
7 422
 
5.5%
6 308
 
4.0%
3 278
 
3.6%
Other values (5) 1088
14.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 7631
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
981
12.9%
h 981
12.9%
b 981
12.9%
p 981
12.9%
. 617
8.1%
8 546
7.2%
1 448
5.9%
7 422
 
5.5%
6 308
 
4.0%
3 278
 
3.6%
Other values (5) 1088
14.3%

torque
Text

Missing 

Distinct226
Distinct (%)23.0%
Missing19
Missing (%)1.9%
Memory size7.9 KiB
2024-11-20T01:43:21.251392image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length27
Median length25
Mean length16.293578
Min length5

Characters and Unicode

Total characters15984
Distinct characters33
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)9.1%

Sample

1st row260 Nm at 1800-2200 rpm
2nd row260Nm@ 1500-2750rpm
3rd row172Nm@ 4300rpm
4th row200Nm@ 1750rpm
5th row96 Nm at 3000 rpm
ValueCountFrequency (%)
4000rpm 114
 
5.5%
3500rpm 97
 
4.7%
200nm 89
 
4.3%
2000rpm 83
 
4.0%
1750rpm 69
 
3.3%
190nm 67
 
3.2%
rpm 63
 
3.0%
90nm 52
 
2.5%
2500rpm 39
 
1.9%
3000rpm 39
 
1.9%
Other values (246) 1373
65.9%
2024-11-20T01:43:22.146905image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3273
20.5%
m 1949
12.2%
1109
 
6.9%
1 1054
 
6.6%
@ 990
 
6.2%
p 973
 
6.1%
r 973
 
6.1%
N 895
 
5.6%
2 860
 
5.4%
5 809
 
5.1%
Other values (23) 3099
19.4%

Most occurring categories

ValueCountFrequency (%)
(unknown) 15984
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 3273
20.5%
m 1949
12.2%
1109
 
6.9%
1 1054
 
6.6%
@ 990
 
6.2%
p 973
 
6.1%
r 973
 
6.1%
N 895
 
5.6%
2 860
 
5.4%
5 809
 
5.1%
Other values (23) 3099
19.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 15984
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 3273
20.5%
m 1949
12.2%
1109
 
6.9%
1 1054
 
6.6%
@ 990
 
6.2%
p 973
 
6.1%
r 973
 
6.1%
N 895
 
5.6%
2 860
 
5.4%
5 809
 
5.1%
Other values (23) 3099
19.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 15984
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 3273
20.5%
m 1949
12.2%
1109
 
6.9%
1 1054
 
6.6%
@ 990
 
6.2%
p 973
 
6.1%
r 973
 
6.1%
N 895
 
5.6%
2 860
 
5.4%
5 809
 
5.1%
Other values (23) 3099
19.4%

seats
Real number (ℝ)

Missing 

Distinct6
Distinct (%)0.6%
Missing19
Missing (%)1.9%
Infinite0
Infinite (%)0.0%
Mean5.4108053
Minimum4
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2024-11-20T01:43:22.384840image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile5
Q15
median5
Q35
95-th percentile7
Maximum9
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.91998528
Coefficient of variation (CV)0.17002742
Kurtosis1.7775742
Mean5.4108053
Median Absolute Deviation (MAD)0
Skewness1.6424577
Sum5308
Variance0.84637292
MonotonicityNot monotonic
2024-11-20T01:43:22.627716image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
5 758
75.8%
7 161
 
16.1%
4 24
 
2.4%
8 23
 
2.3%
6 8
 
0.8%
9 7
 
0.7%
(Missing) 19
 
1.9%
ValueCountFrequency (%)
4 24
 
2.4%
5 758
75.8%
6 8
 
0.8%
7 161
 
16.1%
8 23
 
2.3%
9 7
 
0.7%
ValueCountFrequency (%)
9 7
 
0.7%
8 23
 
2.3%
7 161
 
16.1%
6 8
 
0.8%
5 758
75.8%
4 24
 
2.4%

Interactions

2024-11-20T01:43:06.169361image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-20T01:42:58.439001image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-20T01:43:02.082923image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-20T01:43:04.148569image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-20T01:43:06.460542image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-20T01:42:59.077355image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-20T01:43:02.477937image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-20T01:43:04.626896image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-20T01:43:06.829977image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-20T01:43:00.157560image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-20T01:43:03.082329image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-20T01:43:05.346644image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-20T01:43:07.136955image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-20T01:43:01.270879image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-20T01:43:03.648276image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-20T01:43:05.777645image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Correlations

2024-11-20T01:43:22.845016image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
fuelkm_drivenownerseatsseller_typeselling_pricetransmissionyear
fuel1.0000.1740.0000.2200.1060.1500.0000.133
km_driven0.1741.0000.1640.2460.142-0.3280.243-0.597
owner0.0000.1641.0000.0630.1740.1650.1470.281
seats0.2200.2460.0631.0000.0280.2910.0390.016
seller_type0.1060.1420.1740.0281.0000.3640.3620.196
selling_price0.150-0.3280.1650.2910.3641.0000.6280.710
transmission0.0000.2430.1470.0390.3620.6281.0000.308
year0.133-0.5970.2810.0160.1960.7100.3081.000

Missing values

2024-11-20T01:43:07.727641image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-11-20T01:43:08.375323image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-11-20T01:43:08.850762image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

nameyearselling_pricekm_drivenfuelseller_typetransmissionownermileageenginemax_powertorqueseats
0Mahindra Xylo E4 BS IV2010229999168000DieselIndividualManualFirst Owner14.0 kmpl2498 CC112 bhp260 Nm at 1800-2200 rpm7.0
1Tata Nexon 1.5 Revotorq XE201766500025000DieselIndividualManualFirst Owner21.5 kmpl1497 CC108.5 bhp260Nm@ 1500-2750rpm5.0
2Honda Civic 1.8 S AT2007175000218463PetrolIndividualAutomaticFirst Owner12.9 kmpl1799 CC130 bhp172Nm@ 4300rpm5.0
3Honda City i DTEC VX2015635000173000DieselIndividualManualFirst Owner25.1 kmpl1498 CC98.6 bhp200Nm@ 1750rpm5.0
4Tata Indica Vista Aura 1.2 Safire BSIV201113000070000PetrolIndividualManualSecond Owner16.5 kmpl1172 CC65 bhp96 Nm at 3000 rpm5.0
5Mahindra Thar CRDe201997500012584DieselDealerManualFirst Owner16.55 kmpl2498 CC105 bhp247Nm@ 1800-2000rpm6.0
6Chevrolet Spark 1.0 LS201115000035000PetrolIndividualManualFirst Owner18.0 kmpl995 CC62 bhp90.3Nm@ 4200rpm5.0
7Maruti Ritz ZXi201227500070000PetrolIndividualManualSecond Owner18.5 kmpl1197 CC85.80 bhp114Nm@ 4000rpm5.0
8Maruti Alto LX201114000072000PetrolIndividualManualSecond Owner19.7 kmpl796 CC46.3 bhp62Nm@ 3000rpm5.0
9Hyundai Creta 1.6 CRDi SX201685000058000DieselIndividualManualFirst Owner19.67 kmpl1582 CC126.2 bhp259.9Nm@ 1900-2750rpm5.0
nameyearselling_pricekm_drivenfuelseller_typetransmissionownermileageenginemax_powertorqueseats
990Maruti Alto LXi20079500070000PetrolIndividualManualSecond Owner19.7 kmpl796 CC46.3 bhp62Nm@ 3000rpm5.0
991Honda Brio V MT201237600026000PetrolIndividualManualFirst Owner19.4 kmpl1198 CC86.8 bhp109Nm@ 4500rpm5.0
992Maruti Alto LXi200685000150000PetrolIndividualManualSecond Owner19.7 kmpl796 CC46.3 bhp62Nm@ 3000rpm5.0
993Maruti 800 DX199952000100000PetrolIndividualManualFirst Owner16.1 kmpl796 CC37 bhp59Nm@ 2500rpm4.0
994Maruti Swift Dzire VXi2010240000143000PetrolIndividualManualFirst Owner17.5 kmpl1298 CC85.8 bhp114Nm@ 4000rpm5.0
995Hyundai i10 Magna 1.1L2008250000100000PetrolIndividualManualSecond Owner19.81 kmpl1086 CC68.05 bhp99.04Nm@ 4500rpm5.0
996Hyundai i20 2015-2017 Sportz 1.2201744000050000PetrolIndividualManualSecond Owner18.6 kmpl1197 CC81.83 bhp114.7Nm@ 4000rpm5.0
997Hyundai i20 Era Diesel200934000040000DieselIndividualManualFirst Owner23.0 kmpl1396 CC90 bhp22.4 kgm at 1750-2750rpm5.0
998Hyundai i10 Asta201235000025000PetrolIndividualManualFirst Owner20.36 kmpl1197 CC78.9 bhp111.8Nm@ 4000rpm5.0
999Honda City i DTec SV2016700000110000DieselIndividualManualFirst Owner26.0 kmpl1498 CC98.6 bhp200Nm@ 1750rpm5.0

Duplicate rows

Most frequently occurring

nameyearselling_pricekm_drivenfuelseller_typetransmissionownermileageenginemax_powertorqueseats# duplicates
2Honda Jazz VX201655000056494PetrolTrustmark DealerManualFirst Owner18.2 kmpl1199 CC88.7 bhp110Nm@ 4800rpm5.08
9Jaguar XF 2.0 Diesel Portfolio2017320000045000DieselDealerAutomaticFirst Owner19.33 kmpl1999 CC177 bhp430Nm@ 1750-2500rpm5.06
28Toyota Camry 2.5 Hybrid2016200000068089PetrolTrustmark DealerAutomaticFirst Owner19.16 kmpl2494 CC157.7 bhp213Nm@ 4500rpm5.06
31Volvo V40 D3 R-Design201824750002000DieselDealerAutomaticFirst Owner16.8 kmpl1984 CC150 bhp350Nm@ 1500-2750rpm5.06
1BMW X4 M Sport X xDrive20d201955000008500DieselDealerAutomaticFirst Owner16.78 kmpl1995 CC190 bhp400Nm@ 1750-2500rpm5.04
17Maruti Swift AMT VVT VXI20196500005621PetrolTrustmark DealerAutomaticFirst Owner22.0 kmpl1197 CC81.80 bhp113Nm@ 4200rpm5.04
23Skoda Rapid 1.6 MPI AT Elegance201664500011000PetrolDealerAutomaticFirst Owner14.3 kmpl1598 CC103.5 bhp153Nm@ 3800rpm5.04
25Tata Safari Storme EX2015503000110000DieselIndividualManualFirst Owner14.1 kmpl2179 CC147.94 bhp320Nm@ 1500-3000rpm7.04
4Hyundai Grand i10 1.2 CRDi Sportz201745000056290DieselDealerManualFirst Owner24.0 kmpl1186 CC73.97 bhp190.24nm@ 1750-2250rpm5.03
10Lexus ES 300h2019515000020000PetrolDealerAutomaticFirst Owner22.37 kmpl2487 CC214.56 bhp202Nm@ 3600-5200rpm5.03